Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 440099 |
| Missing cells | 645659 |
| Missing cells (%) | 10.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 50.4 MiB |
| Average record size in memory | 120.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 6 |
Transaction ID is highly correlated with Date of Travel | High correlation |
Date of Travel is highly correlated with Transaction ID | High correlation |
KM Travelled is highly correlated with Price Charged and 1 other fields | High correlation |
Price Charged is highly correlated with KM Travelled and 1 other fields | High correlation |
Cost of Trip is highly correlated with KM Travelled and 1 other fields | High correlation |
Transaction ID is highly correlated with Date of Travel | High correlation |
Date of Travel is highly correlated with Transaction ID | High correlation |
KM Travelled is highly correlated with Price Charged and 1 other fields | High correlation |
Price Charged is highly correlated with KM Travelled and 1 other fields | High correlation |
Cost of Trip is highly correlated with KM Travelled and 1 other fields | High correlation |
Transaction ID is highly correlated with Date of Travel | High correlation |
Date of Travel is highly correlated with Transaction ID | High correlation |
KM Travelled is highly correlated with Price Charged and 1 other fields | High correlation |
Price Charged is highly correlated with KM Travelled and 1 other fields | High correlation |
Cost of Trip is highly correlated with KM Travelled and 1 other fields | High correlation |
Customer ID is highly correlated with Population and 2 other fields | High correlation |
Population is highly correlated with Customer ID and 2 other fields | High correlation |
KM Travelled is highly correlated with Price Charged and 1 other fields | High correlation |
Date of Travel is highly correlated with Transaction ID | High correlation |
Price Charged is highly correlated with KM Travelled and 1 other fields | High correlation |
City is highly correlated with Customer ID and 2 other fields | High correlation |
Cost of Trip is highly correlated with KM Travelled and 1 other fields | High correlation |
Users is highly correlated with Customer ID and 2 other fields | High correlation |
Transaction ID is highly correlated with Date of Travel | High correlation |
Population is highly correlated with City and 1 other fields | High correlation |
City is highly correlated with Population and 1 other fields | High correlation |
Users is highly correlated with Population and 1 other fields | High correlation |
Date of Travel has 80707 (18.3%) missing values | Missing |
Company has 80707 (18.3%) missing values | Missing |
City has 80706 (18.3%) missing values | Missing |
KM Travelled has 80707 (18.3%) missing values | Missing |
Price Charged has 80707 (18.3%) missing values | Missing |
Cost of Trip has 80707 (18.3%) missing values | Missing |
Population has 80706 (18.3%) missing values | Missing |
Users has 80706 (18.3%) missing values | Missing |
Transaction ID is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2021-06-28 05:57:46.076348 |
|---|---|
| Analysis finished | 2021-06-28 05:59:23.552770 |
| Duration | 1 minute and 37.48 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
Transaction ID
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIFORM| Distinct | 440098 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10220059.5 |
| Minimum | 10000011 |
|---|---|
| Maximum | 10440108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.7 MiB |
Quantile statistics
| Minimum | 10000011 |
|---|---|
| 5-th percentile | 10022015.85 |
| Q1 | 10110035.25 |
| median | 10220059.5 |
| Q3 | 10330083.75 |
| 95-th percentile | 10418103.15 |
| Maximum | 10440108 |
| Range | 440097 |
| Interquartile range (IQR) | 220048.5 |
Descriptive statistics
| Standard deviation | 127045.4937 |
|---|---|
| Coefficient of variation (CV) | 0.01243099355 |
| Kurtosis | -1.2 |
| Mean | 10220059.5 |
| Median Absolute Deviation (MAD) | 110024.5 |
| Skewness | 1.475091901 × 10-17 |
| Sum | 4.497827746 × 1012 |
| Variance | 1.614055748 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10088472 | 1 | < 0.1% |
| 10248195 | 1 | < 0.1% |
| 10145945 | 1 | < 0.1% |
| 10361010 | 1 | < 0.1% |
| 10344253 | 1 | < 0.1% |
| 10016543 | 1 | < 0.1% |
| 10174158 | 1 | < 0.1% |
| 10256637 | 1 | < 0.1% |
| 10284931 | 1 | < 0.1% |
| 10184716 | 1 | < 0.1% |
| Other values (440088) | 440088 |
| Value | Count | Frequency (%) |
| 10000011 | 1 | |
| 10000012 | 1 | |
| 10000013 | 1 | |
| 10000014 | 1 | |
| 10000015 | 1 | |
| 10000016 | 1 | |
| 10000017 | 1 | |
| 10000018 | 1 | |
| 10000019 | 1 | |
| 10000020 | 1 |
| Value | Count | Frequency (%) |
| 10440108 | 1 | |
| 10440107 | 1 | |
| 10440106 | 1 | |
| 10440105 | 1 | |
| 10440104 | 1 | |
| 10440103 | 1 | |
| 10440102 | 1 | |
| 10440101 | 1 | |
| 10440100 | 1 | |
| 10440099 | 1 |
Date of Travel
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1095 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 80707 |
| Missing (%) | 18.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42964.068 |
| Minimum | 42371 |
|---|---|
| Maximum | 43465 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.7 MiB |
Quantile statistics
| Minimum | 42371 |
|---|---|
| 5-th percentile | 42465 |
| Q1 | 42697 |
| median | 42988 |
| Q3 | 43232 |
| 95-th percentile | 43429 |
| Maximum | 43465 |
| Range | 1094 |
| Interquartile range (IQR) | 535 |
Descriptive statistics
| Standard deviation | 307.467197 |
|---|---|
| Coefficient of variation (CV) | 0.007156380002 |
| Kurtosis | -1.137362913 |
| Mean | 42964.068 |
| Median Absolute Deviation (MAD) | 273 |
| Skewness | -0.06800364811 |
| Sum | 1.544094233 × 1010 |
| Variance | 94536.07725 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 43105 | 2022 | 0.5% |
| 43084 | 1123 | 0.3% |
| 43077 | 1100 | 0.2% |
| 43449 | 1086 | 0.2% |
| 43063 | 1085 | 0.2% |
| 43456 | 1084 | 0.2% |
| 43448 | 1076 | 0.2% |
| 43091 | 1042 | 0.2% |
| 43428 | 1037 | 0.2% |
| 43079 | 1032 | 0.2% |
| Other values (1085) | 347705 | |
| (Missing) | 80707 | 18.3% |
| Value | Count | Frequency (%) |
| 42371 | 181 | |
| 42372 | 178 | |
| 42373 | 25 | < 0.1% |
| 42374 | 47 | < 0.1% |
| 42375 | 109 | < 0.1% |
| 42376 | 141 | |
| 42377 | 111 | < 0.1% |
| 42378 | 289 | |
| 42379 | 272 | |
| 42380 | 85 | < 0.1% |
| Value | Count | Frequency (%) |
| 43465 | 256 | 0.1% |
| 43464 | 257 | 0.1% |
| 43463 | 825 | |
| 43462 | 843 | |
| 43461 | 318 | 0.1% |
| 43460 | 270 | 0.1% |
| 43459 | 284 | 0.1% |
| 43458 | 279 | 0.1% |
| 43457 | 339 | 0.1% |
| 43456 | 1084 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 80707 |
| Missing (%) | 18.3% |
| Memory size | 6.7 MiB |
| Yellow Cab | |
|---|---|
| Pink Cab |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.528587169 |
| Min length | 8 |
Characters and Unicode
| Total characters | 3424498 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Pink Cab |
|---|---|
| 2nd row | Yellow Cab |
| 3rd row | Yellow Cab |
| 4th row | Pink Cab |
| 5th row | Yellow Cab |
Common Values
| Value | Count | Frequency (%) |
| Yellow Cab | 274681 | |
| Pink Cab | 84711 | 19.2% |
| (Missing) | 80707 | 18.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| cab | 359392 | |
| yellow | 274681 | |
| pink | 84711 | 11.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 549362 | |
| 359392 | ||
| C | 359392 | |
| a | 359392 | |
| b | 359392 | |
| Y | 274681 | |
| e | 274681 | |
| o | 274681 | |
| w | 274681 | |
| P | 84711 | 2.5% |
| Other values (3) | 254133 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2346322 | |
| Uppercase Letter | 718784 | 21.0% |
| Space Separator | 359392 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 549362 | |
| a | 359392 | |
| b | 359392 | |
| e | 274681 | |
| o | 274681 | |
| w | 274681 | |
| i | 84711 | 3.6% |
| n | 84711 | 3.6% |
| k | 84711 | 3.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 359392 | |
| Y | 274681 | |
| P | 84711 | 11.8% |
Space Separator
| Value | Count | Frequency (%) |
| 359392 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3065106 | |
| Common | 359392 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 549362 | |
| C | 359392 | |
| a | 359392 | |
| b | 359392 | |
| Y | 274681 | |
| e | 274681 | |
| o | 274681 | |
| w | 274681 | |
| P | 84711 | 2.8% |
| i | 84711 | 2.8% |
| Other values (2) | 169422 | 5.5% |
Common
| Value | Count | Frequency (%) |
| 359392 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3424498 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 549362 | |
| 359392 | ||
| C | 359392 | |
| a | 359392 | |
| b | 359392 | |
| Y | 274681 | |
| e | 274681 | |
| o | 274681 | |
| w | 274681 | |
| P | 84711 | 2.5% |
| Other values (3) | 254133 |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 80706 |
| Missing (%) | 18.3% |
| Memory size | 6.7 MiB |
| NEW YORK NY | |
|---|---|
| CHICAGO IL | |
| LOS ANGELES CA | |
| WASHINGTON DC | |
| BOSTON MA | |
| Other values (15) |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 11.29947717 |
| Min length | 8 |
Characters and Unicode
| Total characters | 4060953 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ATLANTA GA |
|---|---|
| 2nd row | ATLANTA GA |
| 3rd row | ATLANTA GA |
| 4th row | ATLANTA GA |
| 5th row | ATLANTA GA |
Common Values
| Value | Count | Frequency (%) |
| NEW YORK NY | 99885 | |
| CHICAGO IL | 56625 | |
| LOS ANGELES CA | 48033 | |
| WASHINGTON DC | 43737 | |
| BOSTON MA | 29692 | 6.7% |
| SAN DIEGO CA | 20488 | 4.7% |
| SILICON VALLEY | 8519 | 1.9% |
| SEATTLE WA | 7997 | 1.8% |
| ATLANTA GA | 7557 | 1.7% |
| DALLAS TX | 7017 | 1.6% |
| Other values (10) | 29843 | 6.8% |
| (Missing) | 80706 |
Length
| Value | Count | Frequency (%) |
| new | 99885 | |
| ny | 99885 | |
| york | 99885 | |
| ca | 70889 | 8.0% |
| il | 56625 | 6.4% |
| chicago | 56625 | 6.4% |
| angeles | 48033 | 5.4% |
| los | 48033 | 5.4% |
| dc | 43737 | 4.9% |
| washington | 43737 | 4.9% |
| Other values (28) | 219859 |
Most occurring characters
| Value | Count | Frequency (%) |
| 527800 | ||
| N | 430602 | |
| A | 366625 | 9.0% |
| O | 354823 | 8.7% |
| E | 260025 | 6.4% |
| C | 248502 | 6.1% |
| S | 227035 | 5.6% |
| L | 220310 | 5.4% |
| I | 218705 | 5.4% |
| Y | 212271 | 5.2% |
| Other values (15) | 994255 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3533153 | |
| Space Separator | 527800 | 13.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 430602 | |
| A | 366625 | |
| O | 354823 | |
| E | 260025 | 7.4% |
| C | 248502 | 7.0% |
| S | 227035 | 6.4% |
| L | 220310 | 6.2% |
| I | 218705 | 6.2% |
| Y | 212271 | 6.0% |
| G | 181735 | 5.1% |
| Other values (14) | 812520 |
Space Separator
| Value | Count | Frequency (%) |
| 527800 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3533153 | |
| Common | 527800 | 13.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 430602 | |
| A | 366625 | |
| O | 354823 | |
| E | 260025 | 7.4% |
| C | 248502 | 7.0% |
| S | 227035 | 6.4% |
| L | 220310 | 6.2% |
| I | 218705 | 6.2% |
| Y | 212271 | 6.0% |
| G | 181735 | 5.1% |
| Other values (14) | 812520 |
Common
| Value | Count | Frequency (%) |
| 527800 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4060953 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 527800 | ||
| N | 430602 | |
| A | 366625 | 9.0% |
| O | 354823 | 8.7% |
| E | 260025 | 6.4% |
| C | 248502 | 6.1% |
| S | 227035 | 5.6% |
| L | 220310 | 5.4% |
| I | 218705 | 5.4% |
| Y | 212271 | 5.2% |
| Other values (15) | 994255 |
KM Travelled
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 874 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 80707 |
| Missing (%) | 18.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.56725408 |
| Minimum | 1.9 |
|---|---|
| Maximum | 48 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.7 MiB |
Quantile statistics
| Minimum | 1.9 |
|---|---|
| 5-th percentile | 3.57 |
| Q1 | 12 |
| median | 22.44 |
| Q3 | 32.96 |
| 95-th percentile | 42 |
| Maximum | 48 |
| Range | 46.1 |
| Interquartile range (IQR) | 20.96 |
Descriptive statistics
| Standard deviation | 12.23352593 |
|---|---|
| Coefficient of variation (CV) | 0.5420919125 |
| Kurtosis | -1.126875356 |
| Mean | 22.56725408 |
| Median Absolute Deviation (MAD) | 10.45 |
| Skewness | 0.05577890774 |
| Sum | 8110490.58 |
| Variance | 149.6591566 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33.6 | 1536 | 0.3% |
| 24 | 1080 | 0.2% |
| 22.8 | 1075 | 0.2% |
| 35.7 | 1069 | 0.2% |
| 16.8 | 1065 | 0.2% |
| 37.44 | 1062 | 0.2% |
| 39.6 | 1056 | 0.2% |
| 28.08 | 972 | 0.2% |
| 21.85 | 769 | 0.2% |
| 18 | 754 | 0.2% |
| Other values (864) | 348954 | |
| (Missing) | 80707 | 18.3% |
| Value | Count | Frequency (%) |
| 1.9 | 339 | |
| 1.92 | 375 | |
| 1.94 | 329 | |
| 1.96 | 383 | |
| 1.98 | 374 | |
| 2 | 362 | |
| 2.02 | 341 | |
| 2.04 | 358 | |
| 2.06 | 346 | |
| 2.08 | 369 |
| Value | Count | Frequency (%) |
| 48 | 366 | |
| 47.6 | 381 | |
| 47.2 | 378 | |
| 46.8 | 737 | |
| 46.41 | 380 | |
| 46.4 | 356 | |
| 46.02 | 385 | |
| 46 | 336 | |
| 45.63 | 344 | |
| 45.6 | 704 |
Price Charged
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 99176 |
|---|---|
| Distinct (%) | 27.6% |
| Missing | 80707 |
| Missing (%) | 18.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 423.4433113 |
| Minimum | 15.6 |
|---|---|
| Maximum | 2048.03 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.7 MiB |
Quantile statistics
| Minimum | 15.6 |
|---|---|
| 5-th percentile | 63.42 |
| Q1 | 206.4375 |
| median | 386.36 |
| Q3 | 583.66 |
| 95-th percentile | 944.89 |
| Maximum | 2048.03 |
| Range | 2032.43 |
| Interquartile range (IQR) | 377.2225 |
Descriptive statistics
| Standard deviation | 274.3789114 |
|---|---|
| Coefficient of variation (CV) | 0.6479708243 |
| Kurtosis | 0.7476354732 |
| Mean | 423.4433113 |
| Median Absolute Deviation (MAD) | 187.22 |
| Skewness | 0.8737614916 |
| Sum | 152182138.5 |
| Variance | 75283.78705 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 191.27 | 18 | < 0.1% |
| 298.32 | 18 | < 0.1% |
| 216.37 | 17 | < 0.1% |
| 198.8 | 17 | < 0.1% |
| 181.59 | 17 | < 0.1% |
| 115.53 | 17 | < 0.1% |
| 79.38 | 16 | < 0.1% |
| 260.09 | 15 | < 0.1% |
| 367.68 | 15 | < 0.1% |
| 264.83 | 15 | < 0.1% |
| Other values (99166) | 359227 | |
| (Missing) | 80707 | 18.3% |
| Value | Count | Frequency (%) |
| 15.6 | 1 | |
| 15.75 | 1 | |
| 16.38 | 1 | |
| 16.53 | 1 | |
| 16.76 | 1 | |
| 17.03 | 1 | |
| 17.11 | 1 | |
| 17.21 | 1 | |
| 17.27 | 1 | |
| 17.46 | 1 |
| Value | Count | Frequency (%) |
| 2048.03 | 1 | |
| 2016.7 | 1 | |
| 2013.95 | 1 | |
| 1993.83 | 1 | |
| 1981.05 | 1 | |
| 1978.79 | 1 | |
| 1957.1 | 1 | |
| 1947.91 | 1 | |
| 1925.92 | 1 | |
| 1920.59 | 1 |
Cost of Trip
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 16291 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 80707 |
| Missing (%) | 18.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 286.1901128 |
| Minimum | 19 |
|---|---|
| Maximum | 691.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.7 MiB |
Quantile statistics
| Minimum | 19 |
|---|---|
| 5-th percentile | 46.224 |
| Q1 | 151.2 |
| median | 282.48 |
| Q3 | 413.6832 |
| 95-th percentile | 544.3632 |
| Maximum | 691.2 |
| Range | 672.2 |
| Interquartile range (IQR) | 262.4832 |
Descriptive statistics
| Standard deviation | 157.9936612 |
|---|---|
| Coefficient of variation (CV) | 0.5520584188 |
| Kurtosis | -1.012232752 |
| Mean | 286.1901128 |
| Median Absolute Deviation (MAD) | 131.232 |
| Skewness | 0.1379580609 |
| Sum | 102854437 |
| Variance | 24961.99696 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 362.88 | 186 | < 0.1% |
| 479.808 | 184 | < 0.1% |
| 471.744 | 180 | < 0.1% |
| 205.632 | 178 | < 0.1% |
| 411.264 | 166 | < 0.1% |
| 336.96 | 166 | < 0.1% |
| 428.4 | 164 | < 0.1% |
| 423.36 | 161 | < 0.1% |
| 241.92 | 161 | < 0.1% |
| 443.52 | 160 | < 0.1% |
| Other values (16281) | 357686 | |
| (Missing) | 80707 | 18.3% |
| Value | Count | Frequency (%) |
| 19 | 2 | |
| 19.19 | 4 | |
| 19.2 | 4 | |
| 19.38 | 2 | |
| 19.392 | 1 | < 0.1% |
| 19.4 | 3 | |
| 19.57 | 3 | |
| 19.584 | 2 | |
| 19.594 | 3 | |
| 19.6 | 3 |
| Value | Count | Frequency (%) |
| 691.2 | 9 | < 0.1% |
| 685.44 | 29 | |
| 679.728 | 14 | < 0.1% |
| 679.68 | 33 | |
| 674.016 | 34 | |
| 673.92 | 36 | |
| 668.352 | 15 | < 0.1% |
| 668.304 | 49 | |
| 668.16 | 21 | |
| 662.7348 | 19 | < 0.1% |
| Distinct | 49171 |
|---|---|
| Distinct (%) | 11.2% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23619.51312 |
| Minimum | 1 |
|---|---|
| Maximum | 60000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 664 |
| Q1 | 3530 |
| median | 15168 |
| Q3 | 43884 |
| 95-th percentile | 57784 |
| Maximum | 60000 |
| Range | 59999 |
| Interquartile range (IQR) | 40354 |
Descriptive statistics
| Standard deviation | 21195.54982 |
|---|---|
| Coefficient of variation (CV) | 0.897374544 |
| Kurtosis | -1.560810014 |
| Mean | 23619.51312 |
| Median Absolute Deviation (MAD) | 14002 |
| Skewness | 0.341134246 |
| Sum | 1.039490048 × 1010 |
| Variance | 449251332 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 494 | 54 | < 0.1% |
| 2939 | 53 | < 0.1% |
| 2766 | 51 | < 0.1% |
| 1070 | 51 | < 0.1% |
| 2539 | 50 | < 0.1% |
| 903 | 50 | < 0.1% |
| 1803 | 50 | < 0.1% |
| 944 | 50 | < 0.1% |
| 1067 | 50 | < 0.1% |
| 858 | 50 | < 0.1% |
| Other values (49161) | 439589 |
| Value | Count | Frequency (%) |
| 1 | 29 | |
| 2 | 40 | |
| 3 | 46 | |
| 4 | 26 | |
| 5 | 31 | |
| 6 | 28 | |
| 7 | 36 | |
| 8 | 35 | |
| 9 | 40 | |
| 10 | 24 |
| Value | Count | Frequency (%) |
| 60000 | 18 | |
| 59999 | 8 | |
| 59998 | 9 | |
| 59997 | 10 | |
| 59996 | 4 | < 0.1% |
| 59995 | 13 | |
| 59994 | 13 | |
| 59993 | 13 | |
| 59992 | 11 | |
| 59991 | 9 |
Payment_Mode
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 6.7 MiB |
| Card | |
|---|---|
| Cash |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1760392 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Card |
|---|---|
| 2nd row | Cash |
| 3rd row | Card |
| 4th row | Card |
| 5th row | Card |
Common Values
| Value | Count | Frequency (%) |
| Card | 263991 | |
| Cash | 176107 | |
| (Missing) | 1 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| card | 263991 | |
| cash | 176107 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 440098 | |
| a | 440098 | |
| r | 263991 | |
| d | 263991 | |
| s | 176107 | |
| h | 176107 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1320294 | |
| Uppercase Letter | 440098 | 25.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 440098 | |
| r | 263991 | |
| d | 263991 | |
| s | 176107 | |
| h | 176107 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 440098 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1760392 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 440098 | |
| a | 440098 | |
| r | 263991 | |
| d | 263991 | |
| s | 176107 | |
| h | 176107 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1760392 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 440098 | |
| a | 440098 | |
| r | 263991 | |
| d | 263991 | |
| s | 176107 | |
| h | 176107 |
Gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 6.7 MiB |
| Male | |
|---|---|
| Female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.833846098 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2127366 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Male |
|---|---|
| 2nd row | Male |
| 3rd row | Male |
| 4th row | Male |
| 5th row | Male |
Common Values
| Value | Count | Frequency (%) |
| Male | 256611 | |
| Female | 183487 | |
| (Missing) | 1 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| male | 256611 | |
| female | 183487 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 623585 | |
| a | 440098 | |
| l | 440098 | |
| M | 256611 | |
| F | 183487 | 8.6% |
| m | 183487 | 8.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1687268 | |
| Uppercase Letter | 440098 | 20.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 623585 | |
| a | 440098 | |
| l | 440098 | |
| m | 183487 | 10.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 256611 | |
| F | 183487 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2127366 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 623585 | |
| a | 440098 | |
| l | 440098 | |
| M | 256611 | |
| F | 183487 | 8.6% |
| m | 183487 | 8.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2127366 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 623585 | |
| a | 440098 | |
| l | 440098 | |
| M | 256611 | |
| F | 183487 | 8.6% |
| m | 183487 | 8.6% |
Age
Real number (ℝ≥0)
| Distinct | 48 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.36019705 |
| Minimum | 18 |
|---|---|
| Maximum | 65 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.7 MiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 25 |
| median | 33 |
| Q3 | 42 |
| 95-th percentile | 61 |
| Maximum | 65 |
| Range | 47 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 12.58266805 |
|---|---|
| Coefficient of variation (CV) | 0.3558427016 |
| Kurtosis | -0.4603646408 |
| Mean | 35.36019705 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.6819975877 |
| Sum | 15561952 |
| Variance | 158.3235351 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 15005 | 3.4% |
| 20 | 14997 | 3.4% |
| 39 | 14591 | 3.3% |
| 32 | 14431 | 3.3% |
| 25 | 14407 | 3.3% |
| 22 | 14334 | 3.3% |
| 30 | 14287 | 3.2% |
| 27 | 14235 | 3.2% |
| 40 | 14210 | 3.2% |
| 26 | 14125 | 3.2% |
| Other values (38) | 295476 |
| Value | Count | Frequency (%) |
| 18 | 13572 | |
| 19 | 13941 | |
| 20 | 14997 | |
| 21 | 13507 | |
| 22 | 14334 | |
| 23 | 15005 | |
| 24 | 13916 | |
| 25 | 14407 | |
| 26 | 14125 | |
| 27 | 14235 |
| Value | Count | Frequency (%) |
| 65 | 4065 | |
| 64 | 4760 | |
| 63 | 4532 | |
| 62 | 4458 | |
| 61 | 5207 | |
| 60 | 4597 | |
| 59 | 4978 | |
| 58 | 4995 | |
| 57 | 4380 | |
| 56 | 4583 |
Income (USD/Month)
Real number (ℝ≥0)
| Distinct | 23341 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15092.18199 |
| Minimum | 2000 |
|---|---|
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.7 MiB |
Quantile statistics
| Minimum | 2000 |
|---|---|
| 5-th percentile | 3244 |
| Q1 | 8391 |
| median | 14767 |
| Q3 | 21084 |
| 95-th percentile | 29659 |
| Maximum | 35000 |
| Range | 33000 |
| Interquartile range (IQR) | 12693 |
Descriptive statistics
| Standard deviation | 7987.309505 |
|---|---|
| Coefficient of variation (CV) | 0.5292349052 |
| Kurtosis | -0.6753031806 |
| Mean | 15092.18199 |
| Median Absolute Deviation (MAD) | 6337 |
| Skewness | 0.2999555469 |
| Sum | 6642039109 |
| Variance | 63797113.13 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20884 | 165 | < 0.1% |
| 8756 | 135 | < 0.1% |
| 8899 | 133 | < 0.1% |
| 6808 | 133 | < 0.1% |
| 17468 | 131 | < 0.1% |
| 24826 | 130 | < 0.1% |
| 22525 | 129 | < 0.1% |
| 3878 | 126 | < 0.1% |
| 7070 | 123 | < 0.1% |
| 8518 | 122 | < 0.1% |
| Other values (23331) | 438771 |
| Value | Count | Frequency (%) |
| 2000 | 9 | < 0.1% |
| 2001 | 1 | < 0.1% |
| 2002 | 2 | < 0.1% |
| 2003 | 9 | < 0.1% |
| 2004 | 6 | < 0.1% |
| 2007 | 27 | < 0.1% |
| 2009 | 2 | < 0.1% |
| 2010 | 104 | |
| 2011 | 10 | < 0.1% |
| 2012 | 75 |
| Value | Count | Frequency (%) |
| 35000 | 1 | < 0.1% |
| 34996 | 15 | |
| 34995 | 4 | < 0.1% |
| 34989 | 30 | |
| 34985 | 16 | |
| 34984 | 23 | |
| 34983 | 3 | < 0.1% |
| 34979 | 1 | < 0.1% |
| 34977 | 2 | < 0.1% |
| 34973 | 2 | < 0.1% |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 80706 |
| Missing (%) | 18.3% |
| Memory size | 6.7 MiB |
| 8,405,837 | |
|---|---|
| 1,955,130 | |
| 1,595,037 | |
| 418,859 | |
| 248,968 | |
| Other values (15) |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.24375266 |
| Min length | 9 |
Characters and Unicode
| Total characters | 3681533 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 814,885 |
|---|---|
| 2nd row | 814,885 |
| 3rd row | 814,885 |
| 4th row | 814,885 |
| 5th row | 814,885 |
Common Values
| Value | Count | Frequency (%) |
| 8,405,837 | 99885 | |
| 1,955,130 | 56625 | |
| 1,595,037 | 48033 | |
| 418,859 | 43737 | |
| 248,968 | 29692 | 6.7% |
| 959,307 | 20488 | 4.7% |
| 1,177,609 | 8519 | 1.9% |
| 671,238 | 7997 | 1.8% |
| 814,885 | 7557 | 1.7% |
| 942,908 | 7017 | 1.6% |
| Other values (10) | 29843 | 6.8% |
| (Missing) | 80706 |
Length
| Value | Count | Frequency (%) |
| 8,405,837 | 99885 | |
| 1,955,130 | 56625 | |
| 1,595,037 | 48033 | |
| 418,859 | 43737 | |
| 248,968 | 29692 | 8.3% |
| 959,307 | 20488 | 5.7% |
| 1,177,609 | 8519 | 2.4% |
| 671,238 | 7997 | 2.2% |
| 814,885 | 7557 | 2.1% |
| 942,908 | 7017 | 2.0% |
| Other values (10) | 29843 | 8.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 718786 | ||
| , | 582891 | |
| 5 | 412069 | |
| 8 | 394504 | |
| 3 | 269469 | 7.3% |
| 1 | 265312 | 7.2% |
| 9 | 261224 | 7.1% |
| 0 | 249844 | 6.8% |
| 7 | 209906 | 5.7% |
| 4 | 201319 | 5.5% |
| Other values (2) | 116209 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2379856 | |
| Space Separator | 718786 | 19.5% |
| Other Punctuation | 582891 | 15.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 412069 | |
| 8 | 394504 | |
| 3 | 269469 | |
| 1 | 265312 | |
| 9 | 261224 | |
| 0 | 249844 | |
| 7 | 209906 | |
| 4 | 201319 | |
| 2 | 60806 | 2.6% |
| 6 | 55403 | 2.3% |
Space Separator
| Value | Count | Frequency (%) |
| 718786 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 582891 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3681533 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 718786 | ||
| , | 582891 | |
| 5 | 412069 | |
| 8 | 394504 | |
| 3 | 269469 | 7.3% |
| 1 | 265312 | 7.2% |
| 9 | 261224 | 7.1% |
| 0 | 249844 | 6.8% |
| 7 | 209906 | 5.7% |
| 4 | 201319 | 5.5% |
| Other values (2) | 116209 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3681533 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 718786 | ||
| , | 582891 | |
| 5 | 412069 | |
| 8 | 394504 | |
| 3 | 269469 | 7.3% |
| 1 | 265312 | 7.2% |
| 9 | 261224 | 7.1% |
| 0 | 249844 | 6.8% |
| 7 | 209906 | 5.7% |
| 4 | 201319 | 5.5% |
| Other values (2) | 116209 | 3.2% |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 80706 |
| Missing (%) | 18.3% |
| Memory size | 6.7 MiB |
| 302,149 | |
|---|---|
| 164,468 | |
| 144,132 | |
| 127,001 | |
| 80,021 | |
| Other values (15) |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.661103583 |
| Min length | 7 |
Characters and Unicode
| Total characters | 3112740 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 24,701 |
|---|---|
| 2nd row | 24,701 |
| 3rd row | 24,701 |
| 4th row | 24,701 |
| 5th row | 24,701 |
Common Values
| Value | Count | Frequency (%) |
| 302,149 | 99885 | |
| 164,468 | 56625 | |
| 144,132 | 48033 | |
| 127,001 | 43737 | |
| 80,021 | 29692 | 6.7% |
| 69,995 | 20488 | 4.7% |
| 27,247 | 8519 | 1.9% |
| 25,063 | 7997 | 1.8% |
| 24,701 | 7557 | 1.7% |
| 22,157 | 7017 | 1.6% |
| Other values (10) | 29843 | 6.8% |
| (Missing) | 80706 |
Length
| Value | Count | Frequency (%) |
| 302,149 | 99885 | |
| 164,468 | 56625 | |
| 144,132 | 48033 | |
| 127,001 | 43737 | |
| 80,021 | 29692 | 8.3% |
| 69,995 | 20488 | 5.7% |
| 27,247 | 8519 | 2.4% |
| 25,063 | 7997 | 2.2% |
| 24,701 | 7557 | 2.1% |
| 22,157 | 7017 | 2.0% |
| Other values (10) | 29843 | 8.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 718786 | ||
| 1 | 411294 | |
| , | 359393 | |
| 4 | 344027 | |
| 2 | 284547 | 9.1% |
| 0 | 267675 | 8.6% |
| 9 | 177220 | 5.7% |
| 3 | 162670 | 5.2% |
| 6 | 151567 | 4.9% |
| 7 | 100461 | 3.2% |
| Other values (2) | 135100 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2034561 | |
| Space Separator | 718786 | 23.1% |
| Other Punctuation | 359393 | 11.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 411294 | |
| 4 | 344027 | |
| 2 | 284547 | |
| 0 | 267675 | |
| 9 | 177220 | |
| 3 | 162670 | 8.0% |
| 6 | 151567 | 7.4% |
| 7 | 100461 | 4.9% |
| 8 | 91213 | 4.5% |
| 5 | 43887 | 2.2% |
Space Separator
| Value | Count | Frequency (%) |
| 718786 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 359393 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3112740 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 718786 | ||
| 1 | 411294 | |
| , | 359393 | |
| 4 | 344027 | |
| 2 | 284547 | 9.1% |
| 0 | 267675 | 8.6% |
| 9 | 177220 | 5.7% |
| 3 | 162670 | 5.2% |
| 6 | 151567 | 4.9% |
| 7 | 100461 | 3.2% |
| Other values (2) | 135100 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3112740 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 718786 | ||
| 1 | 411294 | |
| , | 359393 | |
| 4 | 344027 | |
| 2 | 284547 | 9.1% |
| 0 | 267675 | 8.6% |
| 9 | 177220 | 5.7% |
| 3 | 162670 | 5.2% |
| 6 | 151567 | 4.9% |
| 7 | 100461 | 3.2% |
| Other values (2) | 135100 | 4.3% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Transaction ID | Date of Travel | Company | City | KM Travelled | Price Charged | Cost of Trip | Customer ID | Payment_Mode | Gender | Age | Income (USD/Month) | Population | Users | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 10000011.0 | 42377.0 | Pink Cab | ATLANTA GA | 30.45 | 370.95 | 313.6350 | 29290.0 | Card | Male | 28.0 | 10813.0 | 814,885 | 24,701 |
| 1 | 10351127.0 | 43302.0 | Yellow Cab | ATLANTA GA | 26.19 | 598.70 | 317.4228 | 29290.0 | Cash | Male | 28.0 | 10813.0 | 814,885 | 24,701 |
| 2 | 10412921.0 | 43427.0 | Yellow Cab | ATLANTA GA | 42.55 | 792.05 | 597.4020 | 29290.0 | Card | Male | 28.0 | 10813.0 | 814,885 | 24,701 |
| 3 | 10000012.0 | 42375.0 | Pink Cab | ATLANTA GA | 28.62 | 358.52 | 334.8540 | 27703.0 | Card | Male | 27.0 | 9237.0 | 814,885 | 24,701 |
| 4 | 10320494.0 | 43211.0 | Yellow Cab | ATLANTA GA | 36.38 | 721.10 | 467.1192 | 27703.0 | Card | Male | 27.0 | 9237.0 | 814,885 | 24,701 |
| 5 | 10324737.0 | 43224.0 | Yellow Cab | ATLANTA GA | 6.18 | 138.40 | 87.5088 | 27703.0 | Cash | Male | 27.0 | 9237.0 | 814,885 | 24,701 |
| 6 | 10395626.0 | 43400.0 | Pink Cab | ATLANTA GA | 13.39 | 167.03 | 141.9340 | 27703.0 | Card | Male | 27.0 | 9237.0 | 814,885 | 24,701 |
| 7 | 10000013.0 | 42371.0 | Pink Cab | ATLANTA GA | 9.04 | 125.20 | 97.6320 | 28712.0 | Cash | Male | 53.0 | 11242.0 | 814,885 | 24,701 |
| 8 | 10079404.0 | 42634.0 | Yellow Cab | ATLANTA GA | 39.60 | 704.30 | 494.2080 | 28712.0 | Card | Male | 53.0 | 11242.0 | 814,885 | 24,701 |
| 9 | 10186994.0 | 42909.0 | Yellow Cab | ATLANTA GA | 18.19 | 365.63 | 246.6564 | 28712.0 | Card | Male | 53.0 | 11242.0 | 814,885 | 24,701 |
Last rows
| Transaction ID | Date of Travel | Company | City | KM Travelled | Price Charged | Cost of Trip | Customer ID | Payment_Mode | Gender | Age | Income (USD/Month) | Population | Users | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 440089 | 10274704.0 | 43075.0 | Yellow Cab | WASHINGTON DC | 42.80 | 627.21 | 559.8240 | 52614.0 | Card | Female | 44.0 | 8303.0 | 418,859 | 127,001 |
| 440090 | 10311299.0 | 43174.0 | Yellow Cab | WASHINGTON DC | 13.56 | 241.43 | 165.9744 | 52614.0 | Card | Female | 44.0 | 8303.0 | 418,859 | 127,001 |
| 440091 | 10439949.0 | 43102.0 | Yellow Cab | WASHINGTON DC | 34.80 | 507.12 | 484.4160 | 52614.0 | Cash | Female | 44.0 | 8303.0 | 418,859 | 127,001 |
| 440092 | 10284072.0 | 43086.0 | Yellow Cab | WASHINGTON DC | 44.00 | 679.97 | 607.2000 | 51406.0 | Cash | Female | 29.0 | 6829.0 | 418,859 | 127,001 |
| 440093 | 10307228.0 | 43162.0 | Yellow Cab | WASHINGTON DC | 38.40 | 668.93 | 525.3120 | 51406.0 | Cash | Female | 29.0 | 6829.0 | 418,859 | 127,001 |
| 440094 | 10319775.0 | 43203.0 | Yellow Cab | WASHINGTON DC | 3.57 | 67.60 | 44.5536 | 51406.0 | Cash | Female | 29.0 | 6829.0 | 418,859 | 127,001 |
| 440095 | 10347676.0 | 43287.0 | Yellow Cab | WASHINGTON DC | 23.46 | 331.97 | 337.8240 | 51406.0 | Card | Female | 29.0 | 6829.0 | 418,859 | 127,001 |
| 440096 | 10358624.0 | 43314.0 | Yellow Cab | WASHINGTON DC | 27.60 | 358.23 | 364.3200 | 51406.0 | Cash | Female | 29.0 | 6829.0 | 418,859 | 127,001 |
| 440097 | 10370709.0 | 43342.0 | Yellow Cab | WASHINGTON DC | 34.24 | 453.11 | 427.3152 | 51406.0 | Card | Female | 29.0 | 6829.0 | 418,859 | 127,001 |
| 440098 | NaN | NaN | NaN | SAN FRANCISCO CA | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 629,591 | 213,609 |